Causal discovery using compression-complexity measures

نویسندگان

چکیده

Causal inference is one of the most fundamental problems across all domains science. We address problem inferring a causal direction from two observed discrete symbolic sequences $X$ and $Y$. present framework which relies on lossless compressors for context-free grammars (CFGs) sequence pairs quantifies extent to grammar inferred compresses other sequence. infer causes $Y$ if better than in direction. To put this notion practice, we propose three models that use Compression-Complexity Measures (CCMs) - Lempel-Ziv (LZ) complexity Effort-To-Compress (ETC) CFGs discover directions without demanding temporal structures. evaluate these synthetic real-world benchmarks empirically observe performances competitive with current state-of-the-art methods. Lastly, unique applications proposed directly genome belonging SARS-CoV-2 virus. Using large number sequences, show our capture directed information exchange between pairs, presenting novel opportunities addressing key issues such as contact-tracing, motif discovery, evolution virulence pathogenicity future applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Causal Discovery Using A Bayesian Local Causal Discovery Algorithm

This study focused on the development and application of an efficient algorithm to induce causal relationships from observational data. The algorithm, called BLCD, is based on a causal Bayesian network framework. BLCD initially uses heuristic greedy search to derive the Markov Blanket (MB) of a node that serves as the "locality" for the identification of pair-wise causal relationships. BLCD tak...

متن کامل

Causal Discovery Using Proxy Variables

In this paper, we develop a framework to estimate the cause-effect relation between two static entities x and y: for instance, an art masterpiece x and its fraudulent copy y. To this end, we introduce the notion of proxy variables, which allow the construction of a pair of random entities (A,B) from the pair of static entities (x, y). Then, estimating the cause-effect relation between A and B u...

متن کامل

Non-linear Causal Inference using Gaussianity Measures

We provide theoretical and empirical evidence for a type of asymmetry between causes and effects that is present when these are related via linear models contaminated with additive non-Gaussian noise. Assuming that the causes and the effects have the same distribution, we show that the distribution of the residuals of a linear fit in the anti-causal direction is closer to a Gaussian than the di...

متن کامل

Low Complexity Video Compression Algorithm Using AMBTC

A computationally simple algorithm for video compression is presented in this paper. The algorithm gives excellent quality image sequence reconstruction at a peaksignal-to-noise-ratio (PSNR) of approximately 37 decibels (dB) and a very low bit rate of 0.2 bits-per-pixel (bpp). The scheme utilizes absolute moment block truncation coding (AMBTC) for intraframe coding, in conjunction with three-st...

متن کامل

Lecture 24, Causal Discovery

ly, the algorithm works as follows: • Start with a complete undirected graph on all variables. • For each pair of variables, see if conditioning on some set of variables makes them conditionally independent; if so, remove their edge. • Identify all colliders by checking for conditional dependence; orient the edges of colliders. • Try to orient undirected edges by consistency with already-orient...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Biomedical Informatics

سال: 2021

ISSN: ['1532-0480', '1532-0464']

DOI: https://doi.org/10.1016/j.jbi.2021.103724